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CENTRALIZED DATA POSITION INFORMATION STORAGE SYSTEM AND 

METHOD 

Field of the Invention 

[0001] The present invention relates to a method of and apparatus for 
storing data position information relating to data distributed on a tape data 
storage device. 

Background of the Invention 

[0002] In order to store information it is known to use a magnetic tape 
storage system in which a magnetic tape is wound on reels or spools. These 
magnetic tape information storage systems are extensively used to back up, 
archive and store data for future use by a user of, for example, a host user 
interface or PC. 

[0003] A typical magnetic tape storage device 100 is illustrated in Fig. 
1 . The tape storage device 100 may be a stand alone unit, or may be integrated 
within a casing of a host computer entity 101. The data storage device 100 is 
operable to (1 ) receive data from a host computer entity 101 and store data on a 
magnetic tape data storage medium, contained within a tape cartridge, (2) read 
data from cartridges, and (3) input read data to the host computer 101 . 

[0004] Fig. 2 is a perspective external view of a typical known tape 
data storage cartridge 200, comprising a casing 201 containing a single reel 
upon which is wound a length of magnetic tape data storage medium. The 
magnetic tape data storage medium is wound in and out of the cartridge through 
an aperture 202, onto a reel within the tape data storage device in use. 

[0005] Fig. 3 is a schematic front view of cartridge 200 inserted into 
a tape data storage device 100, wherein a length of tape data storage medium 
300 is wound from an internal reel 301 of the cartridge, through a series of 
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capstans and rollers 302-305 onto a second reel 306 in the tape data storage 
device. The tape data storage device comprises a read/write head 307 over 
which the tape 300 is drawn, in forward and reverse directions, to apply read or 
write operations of user data to the tape. 

[0006] Typically, the tape drive controls the movement of the tape over 
the write head to record data onto the magnetic tape, and over the read head to 
generate an electrical signal from which the stored data can be reconstructed. 
Commonly, the read and write heads combined into a single read/write head 
controlled by the tape drive. 

[0007] A length of magnetic tape 300 as known in the art is 
schematically illustrated in Fig. 4 such that data is recorded on the tape in a 
series of parallel, elongated (i.e., linear) data tracks 400. When a command is 
issued by host computer 101 to read specific target data on the tape the tape 
drive using a read head must scan the data tracks 400 to locate the position of 
the target data, thereby allowing the read head to retrieve the data and transfer it 
back to the host. The tape drive, being configured to ascertain a current position 
on the tape relative to the Beginning Of Tape or Wrap (BOW), scans the data 
tracks 400 until the read head passes over a directory 401 positioned at the 
BOW. In the example of Fig. 4, the directory 401 is located at the BOW and 
configured such that the directory contents are distributed along the tape, at the 
end of each Wrap. The contents of the directory 401 coincide with the separate 
data tracks 400, such that the contents of the directory 401 located at the BOW 
or the EOW are used to allow the tape drive to determine if the target data is 
contained on a relevant data track. If data position information in the area 401 
indicates that the target data is not located in a particular data track, then the 
drive must continue reading the data until the drive comes across the target data. 
Similar prior art data storage systems utilize a directory stored in a cartridge 
memory rather than on the tape, and the data are accessed on tape by a read 
operation of the cartridge memory. 
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[0008] When recording data onto a magnetic tape it is known to 
partition the data into a plurality of data sets, such data sets being distributed 
across the various data tracks. The partitioning of data into data sets distributed 
across the data tracks provides a physical position of any one particular data set 
relative to, for example, the BOW and EOW. Such a physical positioning is 
provided because the data sets are spatially separated along the length of the 
tape. 

[0009] Within one particular data set the data is further partitioned into 
a series of records and filemarks, such partitioning giving rise to a logical data 
position for any particular record or filemark. 

[0010] Magnetic tape data storage systems known in the art having 
directories 401 associated with a corresponding data track 400 along the length 
of a tape, and utilize such directories to store logical data position information as 
detailed in Fig. 5. A data track 400 has data 500 distributed along its length. 
The data 500 within a data set and positioned on a data track 400 has 
corresponding data position information stored within the directory 401 as record 
data position information 501 and filemark data position information 502. 
Commonly, the directories also contain data position information 503 relating to 
the positioning of data sets distributed along the length of the data track 400. 

[0011] Fig. 6 is a schematic diagram of a tape. Fig. 6 is helpful in 
describing a typical mode of operation of a prior art magnetic tape data storage 
system that has received a target data command from a host. Using the 
directory 401 located at the BOW, and in particular the contents of the directory 
located at the BOW and EOW corresponding to a particular data track, the tape 
drive determines the tape motion required to position the read head on a data 
track corresponding to the data track where the target data is positioned. The 
directory 401 contains information at the BOW and the EOW. The information 
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indicates the positioning of the target data within this particular data track. As the 
read head is effectively transported to a new data track as shown in Fig. 6 the 
tape motion may be such that the read head moves away from the target data 
position B. The read head must serially read the data track until a system 
associated with the tape determines, using the contents of the directory 
information at the EOW (as shown in Fig. 6), that the tape motion should be 
reversed to access the target data B. Essentially, each data track information 
area 401 functions as a map of the logical position of the data to the actual 
physical position of the data on the tape. The physical position data is 
partitioned into data sets distributed along the data tracks. Obviously, the more 
information a directory, located at the BOW contains, the more complete the 
mapping of the physical position of data on tape. 

[0012] However, a prior art storage device having a complete directory 
(containing information relating to all data on the tape) is restricted by the read 
speed of the tape drive due to the serial reading operation as detailed in Fig. 6 
when the read head moves from, for example, point A 1 to point B where the 
target data are located. Moreover, a directional change of tape motion is 
undoubtedly required at some stage of the data retrieval process; the directional 
change adds to the data access time. 

[0013] Accordingly, the inventors have identified various problems 
associated with the employment and utilization of such magnetic tape data 
storage systems as described above and known in the art. The problems 
identified by the inventors include: 

Utilizing data position information in a distributed manner (the spatially 
separated directories 401) results in excess tape motion and, in particular, 
multidirectional changes. This excessive tape motion inevitably leads to a 
longer time being taken from the time a target data request is issued to the 
actual reading and retrieval of the target data. 
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Due to the operational nature of the magnetic tape system, target data 
searching within a particular data track 400 is undertaken at read speeds. 
The read speeds are dictated by the read head and the processing power 
of the tape drive. This constraint increases the above searching time 
period. An additional problem associated with the necessity to engage the 
read/write head is the excessive reading of data and data position 
information on data tracks in an attempt to locate the target data, as 
described in connection with Fig. 6. The magnetic tape, although generally 
long lasting, is subject to deterioration as a result of excessive tape motion 
and engagement by read/write heads. Excessive reading operations also 
reduce the lifetime of the read head. 

[0014] The inventors have recognized a need for a magnetic tape data 
storage system having a centralized data position information storage utility 
capable of storing data position information relating to data distributed along a 
linear tape. Such a system is capable of providing a data position information 
storage area that can be loaded and unloaded separately from the data stored 
on the tape. The method and apparatus of such a system having an accelerated 
target data searching utility compared to the prior art is disclosed in detail 
herein. 

Summary of the Invention 

[0015] The inventors, on recognizing the various problems associated 
with prior art data storage systems, provide a magnetic tape data storage system 
having a centralized data position information storage area that is effectively a 
pseudo map of the data distributed along the length of tape. By utilizing data 
position information stored centrally, and not distributed along the length of tape 
as found in the art, the inventors provide a robust data storage system capable of 
enhanced data access times following the receipt of a target data request from a 
host. 
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1*00161 A ccording to one embodiment of the present invention, 
following receipt of a target data request from a host, the centralized data 
position information storage area is accessed and the exact position of the target 
data, on the magnetic tape, is determined thereby necessitating a single tape 
displacement action to access this target data. This single tape displacement is 
in contrast to the operational nature of prior art storage systems. 

1*00171 Utilizing a centralized storage area provides a data storage 
system capable of loading and unloading data position information, for example, 
at the beginning and end of a data access session, respectively. According to 
further aspects of the present invention the centralized storage area is 
configurable for updating during data access operations such that at the end of a 
user session the updated centralized storage area can be downloaded to a 
suitable reserve storage area where it remains dormant until the centralized 
storage area is required by the storage device, at which time it is reloaded into 
the centralized storage area for use in determining a target data position on the 
tape. 

r00181 In another embodiment of the present invention the 
centralized data position information storage area is configurable to store data 
position information relating to selected groups of data distributed along the 
length of tape. This type of formatting provides a centralized storage area 
containing data position information relating to striped bands of data along the 
total length of the tape. This reduces the required capacity of the centralized 
storage area due to the reduced resolution of the pseudo data map. 

[0019] According to one aspect of the present invention a method of 
centralized data position information storage and utilization comprises the steps 
of: 

[0020] arranging a byte stream of data into partitioned logical data; 
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[0021] storing data position information relating to said logical data in a 
reserve storage area; 

[0022] transferring said information from said reserve storage area to 
a centralized storage area, wherein said centralized storage area is configured 
to store said information relating to substantially all said partitioned logical data; 
and 

[0023] utilizing said information in said centralized storage area to 
locate target data that is part of said logical data. 

[0024] According to another aspect of the present invention a method 
of storing and utilizing data position information on a tape data storage device, 
comprises the steps of: 

[0025] arranging a byte stream of data into partitioned logical data and 
recording said logical data onto a length of tape; 

[0026] storing data position information relating to said logical data in a 
reserve storage area; 

[0027] transferring said information from said reserve storage area to 
a centralized storage area located within said tape device, wherein said 
centralized storage area is configured to store said information relating to 
substantially all said partitioned logical data; and 

[0028] utilizing said information in said centralized storage area to 
locate target data on said tape, said target data being part of said logical data; 
and 
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[0029] reading said target data using a read head. 

[0030] According to a further aspect of the present invention a data 
position information storage and utilization device comprises: 

[0031] partitioned logical data distributed along a length of tape; 

[0032] a reserve storage area that stores data position information 
relating to said logical data; 

[0033] a centralized storage area that stores said information 
received from said reserve storage area, said centralized storage area storing 
information relating to substantially all said partitioned logical data; 

[0034] a search algorithm for determining the location of a target data 
on said tape; and 

[0035] a read head configured to read said logical data on said tape. 

[0036] The device is operable, in response to a request for said target 
data, to locate said target data on said tape using said information in said 
centralized storage area and to read said target data by using said read head. 

[0037] According to an added aspect of the present invention a 
computer product stores a program having program commands for causing a 
computer to perform the following steps: 

[0038] arranging a byte stream of data into partitioned logical data; 

[0039] storing data position information relating to said logical data in a 
reserve storage area; 
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10040] transferring said information from said reserve storage area to 

a centralized storage area configured to store said information relating to 
substantially all said partitioned logical data; and 

£0041] utilizing said information in said centralized storage area to 

locate target data that is part of said logical data. 

£0042] According to an added aspect of the present invention a 

computer product stores a program having commands for causing a comnntpr 
to store and JJtilize data position information on a tape data storage device . The 
program causes the computer to : 

£0043] arrange a byte stream of data into partitioned logical data and 

record said logical data onto a length of tape; 

£0044] store data position information relating to said logical data in a 

reserve storage area; 

£0045] transfer said information from said reserve storage area to a 

centralized storage area located within said tape device, wherein said centralized 
storage area is configured to store said information relating to substantially all 
said partitioned logical data; 

£0046] utilize said information in said centralized storage area to 

locate a target data on said tape, said target data being part of said logical data; 



and 



£00471 



read said target data using a read head. 



Brief Description of the Drawing 



30970032 US 



Substitute Specification 09/917,782 
1509-203 



-10- 

[0048] For a better understanding of the invention and to show how 
the same may be carried into effect, there will now be described by way of 
example only, specific embodiments, methods and processes according to the 
present invention with reference to the accompanying drawings in which: 

[0049] Fig. 1, as described, is an illustration of a prior art tape data 
storage device connected to a host; 

[0050] Fig. 2, as described, is a perspective view of a prior art tape 
data storage cartridge of a single reel type containing an elongate magnetic tape 
data storage medium; 

[0051] Fig. 3, as described, is a schematic diagram of a prior art tape 
data storage device for reading and writing data to a tape data storage medium 
inserted into the device; 

[0052] Fig. 4, as described, is a schematic diagram of a length of prior 
art magnetic tape configured to store logical data and logical data position 
information; 

[0053] Fig. 5, as described, is a schematic diagram of a prior art 
arrangement of data position information stored within a data storage device; 

[0054] Fig. 6, as described, is a schematic diagram of an operational 
mode of a prior art data storage device for accessing data following a data 
request from a host; 

[0055] Fig. 7 is a block diagram of tape recording system having a 
logical media with a centralized storage area and a physical pipeline according 
to one embodiment of the present invention; 
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[0056] Fig. 8 is an illustration of a data table of the centralized data 
position information storage area of Fig. 7; 

[0057] Fig. 9 is a flow diagram of the general operational nature of 
the data storage system of Fig. 7; 

[0058] Fig. 10 is a schematic diagram including the relationship 
between a start position and a data target in terms of a logical displacement and 
physical displacement on a tape; 

[0059] Fig. 11 is a schematic diagram including a length of magnetic 
tape partitioned into selected data groups; 

[0060] Figs. 12a-12c, together, are a flow diagram according to a 
specific method of operating the system of Fig. 7; and 

[0061] 

[0062] Figs. 13a-13d, together, are a flow diagram of a search 
algorithm for locating a data target according to a specific embodiment of the 
present invention. 



Detailed Description of the Drawing 

[0063] There will now be described by way of example the best mode 
contemplated by the inventors for carrying out the invention. In the following 
description numerous specific details are set forth in order to provide a thorough 
understanding of the present invention. It will be apparent however, to one skilled 
in the art, that the present invention can be practiced without limitation to these 
specific details. In other instances, well known methods and structures have not 
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been described in detail so as not to unnecessarily obscure the present 
disclosure. 



[0064] Fig. 7 includes a data formatting channel 700 responsive to 
data from a host device. Channel 700 comprises a logical data pipeline 701 
responsive to data from the host and operative to (1) perform logical formatting of 
user data and (2) compress user data. Pipeline 701 drives a host including 
logical media that in turn drives physical pipeline or channel 704. The user data 
from the host device, after passing through logical pipeline 701 , is partitioned into 
data set form. The compressed data is then transferred to logical media 702, 
configured with an updateable centralized data position information storage area 
703. Storage area 703, when constructed, is effectively a pseudo data position 
map of logical data stored on a length of magnetic tape. Storage area 703 
holds a logical representation of the data stored on tape, in the form of index 
data. 



[0065] The channel 700 is divided into a physical pipeline or channel 
704 that is operative to format the logical data into a physical format such that the 
data from channel 704 are written by write head 705 onto tape 706 as elongated 
parallel tracks, similar to tracks 400 (Fig. 4). The parallel elongated tracks 400 
written onto tape 706 are illustrated in Fig. 10 and can be written to a length of 
magnetic tape in a predetermined format. 

[0066] The components 701 , 702 and 704 can be firmware, e.g. an 
application specific integrated circuit, or code. 

[0067] Logical media 702 is a dynamic random access memory 
(DRAM), (i.e., a volatile storage area memory) such that information stored 
within the storage area may not be retained in the event of a termination of power 
to the logical media. 
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[0068] The logical data received from a host is formatted in the logical 
pipeline 701 to provide partitioned logical data. This partitioning is in terms of 
individual records and filemarks. The formatted logical data, being in data set 
form, is then passed through the logical media 702 and into the physical pipeline 
704. The physical pipeline 704 is configured to add error recovery information to 
each data set before each data set is written onto tape. The data sets are 
written onto the tape according to the next available piece of tape for storing data. 
Because the data sets do not necessarily always have the same physical size, a 
different number of data sets are likely to be written on different physical data 
tracks. 

[0069] At some stage prior to the receipt of a target data request 
issued by a host, position information is loaded into the centralized data position 
information storage area 703 from a reserve storage area, for example, a 
cartridge memory device. Alternatively the centralized data position storage area 
703 accumulates data position information as the data is processed according to 
pipelines 701 and 704. Once data position information is stored in the storage 
area 703 it can, at some future stage, be transferred or downloaded to a suitable 
reserve storage area for future use. 

[0070] Fig. 8 includes an illustration of a portion of the centralized 
data position information storage area 703 forming part of the logical media 702. 
The storage area 703 comprises a data table configured to store logical data 
position information relating to logical record data and logical file mark data. The 
logical data position information located in the data table 703 is arranged such 
that a logical data position has a corresponding physical position on the magnetic 
tape and a physical data set number. The physical position is in terms of a 
physical track number. These physical parameters are relative to the BOW or 
EOW. The logical data position information contained within the data table 703 
relates to all data distributed along the length of tape. Alternatively, the logical 
data position information within the data table 703 corresponds to selected data 
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groups. Such data groups are spatially distributed along the length of magnetic 
tape. The benefit of storing data position information relating to selected data is 
that the storage area 703 requires less capacity within the logical media 702 for 
data position information storage. 

[0071] Fig. 9 is a flow diagram of the general mode of operation of 
apparatus of Figs. 7 and 8. Following the initial writing of data received from a 
host onto the length of magnetic tape, and the loading into the centralized data 
position information storage area of data position information, a target data 
request is received from a host during step 900. Such a target data request is 
issued, for example, by a user requiring specific data for use, for example, in an 
application. The data request of step 900 requires the tape drive, and in 
particular a read head forming part of the drive, to access the target data and 
transfer or output this data to the host. Following step 900, the operation 
continues to step 901 , during which the logical data position and the physical data 
position of this target data are determined by the logical media 702 via the 
centralised data position information storage area. Then during step 902 the 
logical media 702, having a searching utility in the form of a searching algorithm, 
determines the tape motion required to allow the read head forming part of the 
tape drive to access the target data. The tape motion determined by the logical 
media 702 is based on the current tape drive position, defined by logical 
positional parameters (records and file marks), logical data position information 
within the storage area 703, and an estimate made by the search algorithm, of 
the position of the target data. The logical media 702 calculates the required 
tape motion to, for example, change data tracks and data sets, to access a 
record or plurality of records. The accessed record or plurality of records is 
referred to as the target data. If the centralized data position information 
storage area contains information relating to substantially all the data recorded 
on the magnetic tape, the estimate of the target data position is an actual 
determination of the target position and, as such, the read head can be 
positioned over the target data requiring a single tape displacement motion. 
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However, if the logical data position information is stored in the storage area 703 
and relates to selected groups of data distributed along the length of tape, the 
target data request relates to data that does not have its position stored in the 
storage area 703. In such a scenario the read head performs localized iterative 
searching operations to find this target data in close proximity to the estimated 
position determined by logical media stage 702. Such searching is undertaken 
as the tape is translated past the read head. 

[0072] After step 902 has been completed the tape motion is initiated 
during step 903, during which the read head accesses the target data or 
estimated target data on the magnetic tape. Once the target data is found during 
step 904 the data is processed by the logical media 702 during step 905. 

[0073] Fig. 10 includes an exemplary situation wherein the target data 
has a logical position B along a hypothetical elongated track. The starting logical 
data position has a position A on the hypothetical elongated track. Fig. 10 also 
includes the actual physical position of the starting data, at position A' on the 
second elongated data track 400 of tape 706, and the actual physical position of 
the target data, at position B' on the third elongated track of tape 706. Physical 
position B' is behind physical position A' in the direction of motion of tape 706 
(i.e., position B' is closer to BOW than position A'). In contrast logical position B 
is in front of logical position A on the hypothetical track. In the example of Fig. 
10, the logical media 702 is considered to have just received a target data 
request from a host. As such, following the estimate of the logical target data 
position (B), the read head moves from starting logical data position A to this new 
target logical position B. 

[0074] In terms of the physical data position B' relative to physical 
data position A' on the magnetic tape, and the read head must traverse back 
from position A' along the second data track 400, move to the third data track 
and continue traversing back towards the Beginning of Wrap or tape (BOW) 
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arriving ultimately at position B\ The logical media 702 determines (1) the 
physical data position of the target data with reference to a data set number and 
track number and (2) the logical data position with reference to a record number 
and a filemark number. The traversing of the logical media from physical position 
A' to physical position B\ as illustrated in Fig. 10, is possible due to the data 
position information contained in the updateable centralized data position 
information storage area 703 that contains useful data position information to 
determine the physical target data position B\ 

[0075] The physical position B' of the target data is determined by the 
logical media accessing logical data stored in the storage area 703. The physical 
position B' of the target data is determined by the arrangement of logical data 
within the data table (Fig. 8) of the storage area 703. The data table includes 
columns of record numbers and file marks. The records and file marks are 
related to a physical data set on a physical data track by inclusion of the records 
and file marks in a particular row of the data table. 

[0076] Fig. 11 is a schematic diagram illustrating how selected logical 
data position information relating to selected and partitioned logical data are 
distributed and stored along the length of magnetic tape, according to a further 
embodiment of the present invention. The centralized data position information 
storage area 703, according to the further embodiment, stores data position 
information relating to the percentage of total partitioned logical data distributed 
along the length of the tape. Because the centralized storage area 703 has an 
incomplete pseudo map of the logical data on tape, area 703 requires less 
capacity for storing such information . Accordingly the capacity of the logical 
media can be utilized for additional applications. The tape illustrated in Fig. 1 1 
includes a centralized storage area 703 having a data table (as illustrated in Fig. 
8) that stores data position information relating to logical data distributed along 
the length of the tape. The logical data are stored in selected data groups 1 100, 
that are spaced from each other along the lengths of tracks 400, each of which 
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extends parallel to the edges of the tape. Each of data groups 1100 is 
effectively, stripes of data as illustrated in Fig. 11. Logical data position 
information relating to data not within the selected data group 1 1 00 is not stored 
in the storage area 703. 

[0077] The logical media 702, when requested to access target data 
not falling within a selected data group 1 100, and whose data position information 
is not stored within the storage area 703, performs an effective local search to 
access such data using the data position information of data within a nearest 
selected data group 1100. In such a scenario the logical media 702 determines 
the position of data having data position information stored in the storage area 
702, such information relating to data within a selected data group 1100. The 
read head is then positioned over such data (within a selected data group). The 
logical media then scans the tape in a first tape motion direction, to locate the 
target data. It will be appreciated, by those in the art that the furthest physical 
displacement of the tape during any particular localized search, is the 
displacement from one selected data group to the next nearest neighbor 
selected data group 1100. According to another embodiment the distance 
between selected data groups 1100, along the length of tape, is small so as to 
reduce the time taken for localized data searching. 

[0078] Conversely, and to minimize the amount of information stored 
within the storage area 703, data position information stored within the storage 
area 703 relates to relatively narrow selected data groups 1 100 distributed along 
the length of tape. Such relatively narrow selected data groups 1100 are 
separated from each other by relatively large distances. Formatting of data of 
relatively narrow selected data groups 1100 distances generally results in an 
increased time period for target data to be located during a local search. 

[0079] Storage area 703 is configured with logical data position 
information related to substantially all the partitioned (in terms of data sets) logical 
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data distributed along the length of tape. The partitioned logical data is not 
divided into selected data groups 1 100 as illustrated in Fig. 1 1 . The logical media 
702 having the effective pseudo map of the partitioned logical data distributed 
along the length of tape identifies the logical and physical position of target data, 
without the need to resort to localized searching being conducted between 
neighboring selected data groups 1 100 as identified above. 

[0080] Fig. 12a is a flow diagram detailing the initial stages of an 
initialization procedure following which the centralized data position information 
storage area 703 contains data position information relating to logical data 
distributed along the length of tape. A length of tape is loaded onto a spool or 
reel within the storage system during step 1200. Then during step 1201 the 
logical media 702 performs a check procedure to determine if data position 
information is stored within a reserve storage area. Such a reserve storage area 
is any of (1 ) a suitable cartridge memory (CM) connected to or forming part of the 
storage system, (2) a suitable storage area located on the tape, (3) a further 
storage area positioned within the storage system or (4) a further storage area 
that is a separate entity connected by suitable connecting means. If data position 
information is found within the storage area, the data position information is (1) 
read by the logical media during step 1202 and (2) subsequently written to the 
storage area 703 during step 1203. Following the loading of data position 
information into the storage area 703 during steps 1202 and 1203, the data table 
forming the storage area 703 contains information relating to the position of 
logical data distributed along the length of tape. The data table is an effective 
pseudo map of the logical data that the magnetic tape stores. The logical media 
is configured to possibly delete data within the reserve storage area (step 1204). 
Such data deletion during step 1204 is a preparatory stage in the event of a 
subsequent information download from the storage area 703 to the reserve 
storage area. Conversely, the data position information within the reserve 
storage area can be preserved during step 1205. Such data position 
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information is thereby available for future utilization by the logical media during 
steps 1202 and 1203. 

[0081] If during step 1201, it is found that no data exists in the 
reserve storage area 703, data position information relating to data distributed 
along the length of tape is loaded into storage area 703 from a separate location. 
Such a procedure is performed by the steps in the flow chart of Fig. 12b. In the 
event of an initial blank reserve storage area 703, data position information is 
loaded from a suitable CM during step 1206. During step 1207, the logical 
media 702 is also configured to receive data position information loaded from the 
directories 401 located on the tape at, for example, the BOW or EOW. Such 
directories contain data position information relating to the logical data distributed 
along the length of the tape as illustrated in Fig. 5. 

[0082] Following the loading of data position information into storage 
area 703, forming part of the logical media 702, the storage system, during step 
1208, receives a command from a host to locate and provide access to target 
data. Using a suitable search algorithm, as detailed with reference to Figs. 13a - 
d, the logical media 702 utilizes the data position information within the storage 
area 703 to process target data requests during step 1209. Hence, during step 
1209 target data is transferred from the storage system to a host. In operation, 
the storage system is configured to perform multiple target data requests utilizing 
data position information within the logical media. A shut down command is then 
issued by the host during step 1210, causing data position information to be 
then downloaded to the reserve storage area during step 1211. Data position 
information in this reserve storage area can then be used for subsequent 
initializations of the logical media as detailed in Fig. 12a. Because the logical 
media 702 is preferably a dynamic random access memory (DRAM), unloading 
the tape from the storage system during step 1212, does not result in the logical 
media retaining information relating to a previously loaded tape. 
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[0083] Fig. 12c is a flow chart of the steps following the loading of 
data position information from the reserve storage area to the storage area 703 
within the logical media 702. The storage system having the logical media 
configured with the data table comprising data position information relating to 
logical data distributed along the length of tape, is now configured to receive a 
target data command from a host during step 1208. Following the receipt of 
such a target data command the logical media 702, as detailed above, conducts 
target data processing during step 1209. Following the undertaking of a single or 
multiple target accessing operations, the host issues a shut down command 
during step 1210. The shut down command causes a loss of power to the 
logical media 702, step 1216. If the logical media, and in particular, the storage 
area 703 is a volatile memory (DRAM), data position information can be 
downloaded from this storage area to a suitable reserve storage area in order to 
preserve the data position information. Accordingly, data position information 
within the storage area 703 is written to the reserve storage area during stage 
1214. 

[0084] The transfer of data position information during step 1215 is 
appropriate if the logical media 702 is configured to update the storage area 703 
with data position information relating to data that were updated on the length of 
tape. In this arrangement, the logical media 702 updates the storage area 703 
with data position information during a target data access operation following 
receipt of a target data command issued by a host. In the event of a centralized 
storage area 703 having incomplete data position information relating to logical 
data stored on tape, the logical media 702 is configured to dynamically build the 
storage area with data position information each time the tape is displaced and a 
data read operation undertaken. The updating of the storage area 703 is 
possible though the utilization of a suitable update algorithm or algorithms that 
are operative to (1 ) determine a logical data position and physical data position 
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with reference to a known position and (2) store this new data position 
information within the centralized storage area 703. 

[0085] According to an alternative procedure in response to a shut 
down command from the host (step 210) data position information within storage 
area 703, is not written to the reserve storage area during step 1214; the not 
writing step is shown by step 1215 on the flow region of Fig. 12c. By not writing 
data to the reserve storage area, problems associated with corrupted data within 
the centralized storage area 703 are avoided. In such an event, the corrupted 
data within the storage area 703 following the unloading of the tape during step 
1212 are lost such that the storage area 703 can then be loaded with 
uncorrupted data position information from a suitable reserve storage area. 

[0086] Fig. 13a is a flow diagram detailing the initial stages of data 
retrieval by the logical media 702 from a magnetic tape data storage medium. 
Initially, the tape is loaded into the storage system during step 1200. A target 
data request is then received from a host during step 1208. The logical media 
702 then, during step 1300, checks the centralized data position information 
storage area 703 for complete data position information relating to substantially 
all the partitioned logical data distributed along the length of tape. 

[0087] If the centralized data position information storage area 703 
contains data position information relating to selected data groups distributed 
along the length of tape (an incomplete storage area), the logical media 703 
performs at least one localized search to determine the target data position 
based on data position information relating to data positioned in close proximity 
to the target data on the magnetic tape. In the event of an incomplete 
centralized storage area 703, the logical media 702 establishes the current 
position during step 1301 . The current position is represented in terms of logical 
data position parameters (record numbers and file mark numbers) and physical 
data position parameters (data set numbers and track numbers). The 
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determination of the current position is conducted by engaging the read head 
forming part of the tape drive. The logical media then, during step 1302, 
estimates the target data position based on the current position parameters and 
any useful data position information within the storage area 703. The useful data 
position information relates to data located in close proximity to the target data 
position on the magnetic tape. Then, during step 1303 the estimated target data 
position is compared with the current position. If the estimated target data 
position is greater than the current position, the system advances to step 1304, 
Fig. 13b. In step 1304 the logical media, via the tape drive, changes its current 
position within a particular data set and on a particular data track to a current 
position + N, where N is an integer dependent upon the physical layout of data 
tracks on the magnetic tape. The formatting of data tracks on the tape takes the 
form of a series of parallel tracks running the length of the tape or alternatively a 
squashed square spiral in which data is first written onto the centre of the tape 
such that subsequently written data extends out from this central position in a 
square spiral. For example, the estimated target data position may be a factor of 
10 greater than the current data position on the tape. If so, the tape drive via the 
logical media 702 displaces the tape (in terms of data sets and data tracks, by 
the appropriate parameter N). Following the change of position of step 1 304 the 
new data position of the logical media 702 on the tape is read during step 1305. 
This new position is read in terms of physical track and physical data set number 
and logical record and logical file mark number. Steps 1302 and 1303 are then 
repeated in order to determine if the new physical data position is the desired 
target data position. 

[0088] If the estimated target data position is found during step 1103 
to be not greater than the current position, the logical media 702 advances to 
step 1306 (Fig. 13c) to determine if the estimated target data position is less than 
the current position. If the estimated target data position is found to be less than 
the current position, the logical media 702 determines (during step 1307) that the 
tape drive should change position from its current logical and physical data 
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position to this current position - N. Following such a change in position, the new 
data position is then (during step 1305) determined by the read head and logical 
media 702, in terms of track number, data set number, record number and file 
mark number. After step 1305 operation returns to step 1302 and a number of 
the above steps are then repeated to determine if the target data position has 
been located. If it is determined during step 1306 that the estimated target data 
position is not less than the current data position, then the logical media 702 
determines (during step 904) that the target data has been found, such that at 
this position the read head can then access the target data during step 905. 

[0089] Fig. 13d includes the procedure following the determination 
during step 1300 (Fig. 13a) that a complete data position information storage 
area has been located. If the storage area 703 contains data position 
information relating to substantially all the partitioned logical data distributed 
along the length of tape, the logical media 702 establishes the current position 
during step 1301; steps 1301 of Figs. 13a and 13d are the same. The logical 
media 702 then, during step 902, utilizes the data position information within the 
storage area 703 to determine the required tape motion to locate the target data. 
When the read head is located over the target data, the target data are read and 
transferred to the host during step 905. 

[0090] As will be appreciated by those skilled in the art, the specific 
and further embodiments of the present invention are configured for operation 
when utilized in a storage system in which the tape drive is displaceable relative 
to the magnetic tape medium. Such a system, utilizing a static tape with 
displaceable tape drive is operative to locate target data, following a request 
from a host, due to the availability of data that are transported past the read 
head of the tape drive. 

[0091] The transporting of logical data past the read head is common 
to both operational methods of the storage system described herein, in which the 
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tape drive is static and the magnetic tape is displaceable, and a system in which 
the tape drive is displaceable and the magnetic tape is static. 



